Skip to content

Conversation

@v-shobhit
Copy link
Contributor

Adding gpt-oss-120b reference implementation.
This uses SGLang backend to serve gpt-oss-120b model. Scripts are provided to run this in Offline/Server scenario in PerformanceOnly/AccuracyOnly modes.



def decode_hex_to_tokens(hex_data: str) -> List[int]:
"""Decode hex string to list of token IDs (int32).
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not a concern here - we added int32 after int64 didnt work so ds-r1 code has support for both while defaulting to int32:


total_correct = sum(stats["correct"] for stats in dataset_stats.values())
total_samples = sum(stats["total"] for stats in dataset_stats.values())
overall_accuracy = (
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

consider putting formulas on one line, if block on next

similar below (final-score-percentage)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

IIUC these are linux soft links?

Copy link
Contributor Author

@v-shobhit v-shobhit Jan 5, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah that is right

@v-shobhit
Copy link
Contributor Author

@arjunsuresh @mrmhodak can we try to get this merged soon? 🙏

@arjunsuresh
Copy link
Contributor

@v-shobhit once the WG approves I can merge it. I'm told to not do any merge outside of the WG discussions 😕

mrmhodak
mrmhodak previously approved these changes Jan 6, 2026
@hanyunfan hanyunfan dismissed nvzhihanj’s stale review January 6, 2026 17:44

Zhihan confirmed this isn’t an issue and will address it after the merge, if needed.

@hanyunfan
Copy link
Contributor

@anandhu-eng will create an empty commit as a workaround for this checker issue. This problem ultimately needs to be resolved by the Automation Work Task Force.

@arjunsuresh
Copy link
Contributor

PR is ready to be merged once approved.

@hanyunfan hanyunfan self-requested a review January 6, 2026 20:02
Copy link
Contributor

@hanyunfan hanyunfan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As we talked in WG, this is approved.

@hanyunfan hanyunfan merged commit 5523589 into mlcommons:master Jan 6, 2026
16 checks passed
@github-actions github-actions bot locked and limited conversation to collaborators Jan 6, 2026
@hanyunfan
Copy link
Contributor

@v-shobhit merged. thanks for all the hard work!

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

10 participants